350 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
1825077 sentencesProduction Status:
Existing-used
Use:
Statistical phrase alignment
Paper:
N/A
Documentation:
Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005
Written
Corpus,
Language Type:
Trilingual
Languages:
English Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
470 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
http://opus.lingfil.uu.se/Europarl.php
Written
Corpus,
Language Type:
Multilingual
Languages:
English Finnish German french
Availability:
Freely Available
License:
<Not Specified>
Size:
100,000 sentences in 4 languages Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
Publicly available
Written
Corpus,
Language Type:
Multilingual
Languages:
English Greek Standard Arabic french
Availability:
Freely Available
License:
Creative Commons Attribution 3.0 Unported License
Size:
2 Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
http://multiling.iit.demokritos.gr/file/view/353/tac-2011-multiling-pilot-dataset-all-files-source-texts-human-and-system-summaries-evaluation-data
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
french
Availability:
From Data Center(s)
License:
Licence Creative Commons Attribution - Pas d'Utilisation Commerciale - Partage dans les Mêmes Conditions 4.0 International
Size:
None Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Enquête Socio-Linguistique à Orléans (ESLO) | /N |
Documentation:
http://eslo.huma-num.fr/index.php, https://journals.openedition.org/corpus/2036#tocto1n2, French, public
Written
Corpus,
Language Type:
Trilingual
Languages:
English french italian
Availability:
Freely Available
License:
Creative Commons
Size:
3194 sentences Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Exploiting catenae in a parallel treebank alignment
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Manuela Sanguinetti | University of Torino, Department of Computer Science | IT | ||
| Author 2 | Cristina Bosco | Università di Torino | IT | University of Torino, Department of Computer Science | IT |
| Author 3 | Loredana Cupi | University of Torino, Department of Foreign Languages and Literatures and Modern Cultures | IT | ||
| Main Contact | Manuela Sanguinetti | University of Torino, Department of Computer Science | None |
Documentation:
<Not Specified>
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
CC BY 4.0
Size:
236 hours Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ali Can Kocabiyikoglu | University of Grenoble Alpes | FR |
| Author 2 | Laurent Besacier | LIG | FR |
| Author 3 | Olivier Kraif | University of Grenoble Alpes | FR |
| Main Contact | Ali Can Kocabiyikoglu | University of Grenoble Alpes | None |
Documentation:
https://github.com/alicank/Translation-Augmented-LibriSpeech-Corpus
Written
Ontology,
Language Type:
Trilingual
Languages:
English Spanish french
Availability:
Freely Available
License:
Creative Commons Non Profit
Size:
2 GByte Production Status:
Newly created-finished
Use:
Semantic Web
-
Paper title:A disambiguation resource extracted from Wikipedia for semantic annotation
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Eric Charton | <Not Specified> | None | ||
| Author 2 | Michel Gagnon | <Not Specified> | None | École Polytechnique de Montréal | None |
| Main Contact | Eric Charton | École Polytechnique de Montréal | CA |
Documentation:
On website
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
french
Availability:
From Data Center(s)
License:
Size:
7.2 hours Production Status:
Newly created-finished
Use:
Perception of human-human and human-machine conversation
-
Paper title:The Brain-IHM Dataset: a New Resource for Studying the Brain Basis of Human-Human and Human-Machine Conversations
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Magalie Ochs | BrainIHM Corpus | /N |
Documentation:
None




